Article 5215

Title of the article

ESTIMATION OF ALGORITHMS FOR CALCULATION OF DISTANCE BETWEEN DNA LINES

Authors

Mel'nikov Boris Feliksovich, Doctor of physical and mathematical sciences, professor, sub-department
of applied mathematics and informatics, Togliatti State University (14 Belorusskaya street, Togliatti, Russia), barmaley62@yandex.ru
Pivneva Svetlana Valentpnovna, Candidate of pedagogical sciences, associate professor, sub-department of mathematical modeling, Togliatti State University (14 Belorusskaya street, Togliatti, Russia), tlt.swetlana@rambler.ru
Trifonov Maksim Andreevich, Postgraduate student, Togliatti State University (14 Belorusskaya street, Togliatti, Russia), trifonov_max@mail.ru

Index UDK

621.317.7

Abstract

Background. Often it is required to measure distinction or distance between two lines (for example, in evolutionary, structural or functional researches of biological lines). As line sequences of mitochondrial DNA approximately make 17 000 symbols {a, g, c, t}, in order to solve the set problem the authors chose objective
algorithms of indistinct comparison that calculate the distance in polynomial time. In the research, when calculating the metrics of the known algorithms of inexact comparison of lines, there were received various results. The work purpose is to develop the methods of qualitative assessment of the received results. Development of qualitative assessment will allow to choose the most acceptable algorithm that will improve researches in various subject areas.
Materials and methods. The theory of triangular norm in metric space was used as a method of research.
Results. The initial data were obtained from the NCBI databank, and 30 line sequences of mitochondrial DNA were randomly chosen. As a result of perfomance of algorithms of comparison of 30 line sequences the authors adduced qualitative estimates.
Conclusions. Using the obtained qualitative estimates of metrics the best algorithm of comparison of line sequences has been determined.

Key words

metric evaluation, algorithms, multiheuristic approach.

Download PDF
References

1. Gasfild D. Stroki, derev'ya i posledovatel'nosti v algoritmakh. Informatika i vychislitel'naya biologiya [Lines, trees and sequences in algorithms. Informatics and calculus biology]. Saint-Petersburg: Nevskiy dialekt, BKhV-Peterburg, 2003, 654 p.
2. Boytsov L. Prikladnaya matematika i informatika [Applied mathematics and informatics]. 2000, no. 7.
3. Mel'nikov B. F., Panin A. G. Vektor nauki Tol'yattinskogo gosudarstvennogo universiteta [Scientific vector of Togliatti State University]. 2012, no. 4 (22), pp. 83–86.
4. NCBI: nucleotide database, 2015. Available at: http://www.ncbi.nlm.nih.gov/nuccore.
5. Pivneva S. V., Trifonov M. A. Vektor nauki Tol'yattinskogo gosudarstvennogo universiteta [Scientific vector of Togliatti State University]. 2010, no. 3, pp. 28–30.
6. Mel'nikov B. F., Mel'nikova E. A. Izvestiya vysshikh uchebnykh zavedeniy. Povolzhskiy region. Ser. Estestvennye nauki [University proceedings. Volga region. Physical and mathematical sciences]. 2007, no. 2, pp. 25–28.
7. Sayfullina E. F., Semenov R. I. Evristicheskie algoritmy i raspredelennye vychisleniya [Heuristic algorithm and distributed calculations]. 2014, vol. 1, no. 2, pp. 43–57.
8. Needleman S., Wunsch C. Journal of Molecular Biology. 1970, no. 48 (3), pp. 443–453.
9. Winkler W. Proceedings of the Section on Survey Research Methods. American Statistical Association, 1990, pp. 354–359.
10. Ewing B., Hillier L., Wendl M., Green R. Genome Res. 1998, no. 8 (3), pp. 175–185.
11. Altschul S. F. Journal of Molecular Biology. 1991, no. 219 (3), pp. 555–565.

 

Дата создания: 06.10.2015 15:11
Дата обновления: 20.10.2015 15:30